Scalable generalized median graph estimation and its manifold use in bioinformatics, clustering, classification, and indexing
نویسندگان
چکیده
In this paper, we present GMG-BCU — a local search algorithm based on block coordinate update for estimating generalized median graph given collection of labeled or unlabeled input graphs. Unlike all competitors, is designed both discrete and continuous label spaces can be configured to run in linear time w. r. t. the size whenever node edge labels are computable time. These properties make usable applications such as differential microbiome data analysis, classification, clustering, indexing. We also prove theoretical graphs, namely, that they exist under reasonable assumptions which met almost application scenarios, general non-unique, NP-hard compute APX-hard approximate, no polynomial ?-approximation exists any ? unless isomorphism problem P. Extensive experiments six different datasets show our heuristic always outperforms state art terms runtime quality (on most datasets, quality), it only available cope with collections containing several thousands shows very promising potential when used aforementioned applications. freely GitHub: https://github.com/dbblumenthal/gedlib/.
منابع مشابه
the clustering and classification data mining techniques in insurance fraud detection:the case of iranian car insurance
با توجه به گسترش روز افزون تقلب در حوزه بیمه به خصوص در بخش بیمه اتومبیل و تبعات منفی آن برای شرکت های بیمه، به کارگیری روش های مناسب و کارآمد به منظور شناسایی و کشف تقلب در این حوزه امری ضروری است. درک الگوی موجود در داده های مربوط به مطالبات گزارش شده گذشته می تواند در کشف واقعی یا غیرواقعی بودن ادعای خسارت، مفید باشد. یکی از متداول ترین و پرکاربردترین راه های کشف الگوی داده ها استفاده از ر...
Graph-Based k-Means Clustering: A Comparison of the Set Median versus the Generalized Median Graph
In this paper we propose the application of the generalized median graph in a graph-based k -means clustering algorithm. In the graph-based k -means algorithm, the centers of the clusters have been traditionally represented using the set median graph. We propose an approximate method for the generalized median graph computation that allows to use it to represent the centers of the clusters. Exp...
متن کاملidentifying the strategies persian efl learners use in reading an expository text in english and examining its relation to reading-proficiency and motivation: a think-aloud study
هدف اصلی از این مطالعه بررسی نوع و میزان استراتژی هایی بود که دانشجویان فارسی زبان رشته ی زبان انگلیسی در حین خواندن یک متن انگلیسی به کار گرفتند. این مطالعه همچنین به بررسی تفاوت های استراتژی های مورد استفاده بین دارندگان سطح بالا و پایین درک مطلب پرداخت. نوع همبستگی بین استراتژی به کار گرفته و درک مطلب از یک سو و استراتژی به کار گرفته و انگیزه از سوی دیگر نیز در این تحقیق مورد آزمایش قرار گرف...
15 صفحه اولSIGNED GENERALIZED PETERSEN GRAPH AND ITS CHARACTERISTIC POLYNOMIAL
Let G^s be a signed graph, where G = (V;E) is the underlying simple graph and s : E(G) to {+, -} is the sign function on E(G). In this paper, we obtain k-th signed spectral moment and k-th signed Laplacian spectral moment of Gs together with coefficients of their signed characteristic polynomial and signed Laplacian characteristic polynomial are calculated.
متن کاملRiemannian Median and Its Estimation
In this paper, we define the geometric median of a probability measure on a Riemannian manifold, give its characterization and a natural condition to ensure its uniqueness. In order to calculate the median in practical cases, we also propose a subgradient algorithm and prove its convergence as well as estimating the error of approximation and the rate of convergence. The convergence property of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information Systems
سال: 2021
ISSN: ['0306-4379', '1873-6076']
DOI: https://doi.org/10.1016/j.is.2021.101766